|
|
Accession Number |
TCMCG075C14542 |
gbkey |
CDS |
Protein Id |
XP_007035297.2 |
Location |
complement(join(29734024..29734251,29734364..29734842,29734924..29735074,29735747..29736046,29736316..29736373,29736895..29737038,29737124..29737380,29737698..29737904,29738011..29738184,29738415..29738482,29738945..29739003,29739585..29739660,29739754..29739859,29739957..29740066,29740678..29740842,29741259..29741344,29741640..29741704,29742005..29742052,29742386..29742505,29742603..29742678,29743059..29743156,29743246..29743384,29744035..29744153)) |
Gene |
LOC18603332 |
GeneID |
18603332 |
Organism |
Theobroma cacao |
|
|
Length |
1110aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA341501 |
db_source |
XM_007035235.2
|
Definition |
PREDICTED: DNA mismatch repair protein MSH1, mitochondrial isoform X2 [Theobroma cacao] |
CDS: ATGTACTGGTTAGCAACGCGGAACGCCGTCGTTTCAATCCCTAGATGGCGTTCTCTTGCCCTTCTCCTCCGTTCCCCTCTCAACAAATACGCCTCCTTAAACCCCTCGTCACTTCTACTTGGAAGACAGTTTGGGCAGATACATTGTTTCAAAGATAAGAAGATTTTGAGAGAAACCACCAAATTTACTAGGAAATTTAAGGCACCGAATAGGGCCCTAGATGATAAGGATCTTTCTCACATAATTTGGTGGAAAGAGAGACTGCAGCTGTGTCGGAAACCTTCCACTCTCAATTTGGTTAAGAGGCTTGTGTATAGCAATTTGCTTGGTGTGGATGTTAACCTGAAAAATGGCAGTTTGAAAGAAGGGACACTGAATTGTGAGATTTTGCAGTTCAAGTCAAAGTTTCCACGTGAAGTTTTGCTCTGCAGGGTTGGGGATTTTTATGAAGCCCTCGGAATAGATGCTTGCATTCTTGTTGAATATGCTGGTTTGAATCCTTTTGGTGGTTTGCGTTCAGATAGTATTCCAAGAGCTGGCTGCCCTGTTGTGAATCTTCGCCAAACTTTGGATGACCTAACACGTAATGGTTATTCAGTGTGCATTGTGGAGGAAGTTCAAGGTCCAACACAAGCTCGTTCTCGTAAAGGACGTTTTATATCTGGGCATGCACATCCGGGTAGTCCTTATGTATTTGGACTTGTTGGGGTTGATCATGATCTTGATTTTCCAGAACCAATGCCTGTTGTTGGTATATCTCGTTCAGCAAGGGGATATTGCATAACTCTGGTTTTAGAGACTATGAAGACATATTCTTCAGAGGATGGTCTTACTGAAGAAGCATTGGTAACCAAGCTGCGAATGTGTAGATACCATCACCTGTTTCTGCATTTATCGTTGAGAGACAATGCTTCAGGAACTTGTCGTTGGGGTGAATTTGGTGCAGGAGGCCTGTTGTGGGGAGAATGCACTACCAGACATTTTGAATGGTTTGAAGGCAATCCTGTCACTGAGCTGTTGTATAAGGTAAAGGAGCTTTATGGGCTTGATGATGAGGTTTCTTTCAGAAATGTCACTGTTCCTTCAGAAAGTAGACCCCGTCCTTTACACCTAGGAACAGCAACGCAGATTGGTGCCATCCCAACAGAAGGAATACCTTGTTTATTGAAGGTGCTGCTGCCATCAAATTGCACTGGGCTACCTGCTCTGTATATTAGAGATCTTCTTCTTAATCCTCCTGCTCATGAAATTGCATCTACAATTCAAGCAACTTGCAAACTTATGAGCAGTATCAAATGCTCAATTCCTGAGTTTACTTGTGTCGCATCTGCAAAGCTTGTGAAGCTACTTGAACTAAGGGAGGCCAACCATATTGAGTTTTGTAGAATAAAAAATGTTGTTGATGAAATACTGCACATGCATAGAAGCACGGACCTCAAAGAAATTCTGAAATTATTGATGGATCCTGCATGGGTGGCAACTGGGTTGAAGATTGACTTTGAGACACTGGTTGATGAGTGTGAATGGGTTTCAGAGAGAATTGGTCAAATGATTTTTCTGGATGGTGAAAATGATCAAAAGATAAGTTCTTATGCCAATATTCCTGGTGAATTTTTTGAGGACATGGAATCTTCATGGAAAGGTCGAGTCAAGAAGCTCCATATAGAAGAAGCAGTTGCAGAAGTTGACAGCGCAGCTGAGGCCTTATCTTTAGTGGTTACTGAAGATTTTCTCCCCATTGTCTCAAGAATAAAAGCGACCTCAGCTCCTCTTGGTGGCCCAAAGGGAGAAATATTATATGCTCGAGAGCATGAAGCTGTTTGGTTTAAGGGCAAACGGTTTGCACCAGCTGTATGGGCTGGTACTCCTGGCGAAGAACAAATTAAGCAGCTTAAGCCTGCTTTAGATTCAAAAGGTAGAAAGGTTGGAGAGGAATGGTTTACCACAATGAAGGTGGAGGACGCTTTAACGAGGTACCATGATGCTGGTGGCAAGGCAAAGGCAAGGGTTCTGGAATTGTTAAGAGGACTTTCTGCTGAGTTACAAACTAAGATAAACATCCTTGTCTTTGCTTCTATGCTGCTTGTTATCGCAAAGGCATTGTTTGCTCATGTGAGTGAGGGGAGAAGAAGGAAATGGGTTTTCCCTATACTTACAGGATTCAGTAGTTCAAAGGGTGGAGAATCATTGGATGAAACAAGAGGAATGAAGATAGTTGGTTTGACCCCATATTGGTTTGACGTGTCAGAAGGCTGTGCTGTGCTTAATACAGTTGATATGCAATCGTTATTTATTTTGACAGGACCAAATGGGGGTGGTAAATCAAGTTTGCTTCGATCAATTTGTGCAGCTGCATTACTTGGAATTTGTGGATTTATGGTTCCTGCTGAATCAGCCTTAATTCCTCAATTTGATTCAGTAATGCTTCACATGAAATCATATGATAGCCCAGCTGATGGGAAAAGTTCATTTCAGGTAGAAATGTCAGAGCTCCGATCCATCATTAGTGGAGCCAGTTCAAGGAGTCTTGTGCTTGTAGATGAAATTTGCCGAGGAACAGAAACGGTGAAAGGGACTTGCATTGCTGGTAGCATCGTTGAGACTCTTGATGAAATTGGCTGTCTAGAATATGTTGATGGACAAACAAAACCAACTTGGAAGTTGGTAGATGGGATCTGCAGAGAAAGCCTTGCATTTGAAACAGCAAAGAAGGAAGGAGTTGCTGAGACAATAATACAAAGAGCTGAAGAACTTTATTCATCAGTCAATGCAAAAGAAGTATCTTCAGGAAGATTTAACACACAACTAGCACAGGTTGGTTCTGAAGGAGCCCAACTTCTATCAAATAGGACTCAAGCAGGATCTCTCTGTCATAAGAGAAAGCCAACAAACAGAATGGAAGTCTTACAGAAGGAAGTTGAGAGTGCTGTTACCTTAATTTGTCAGAAGAAGCTAATGGAGCTCTATAAGCAGAGAAACACATTGGAACTTCCAATCTTAAACTCTGTTGCTATTGCTGCTAGGGAACAGCCTCCTCCTTCAACTATAGGTGCTTCTTGCTTGTATGTCATGTTCAGACCTGATAAGAAACTATATATTGGAGAGACGGATGATCTTGATGGTCGAGTTCGTTCTCATCGTTCGAAGGAAGGAATGCAAAATGCAACCTTCCTTTATTTCATTGTTCCAGGGAAGAGTATCGCTCGCCAACTAGAAACTCTCCTGATCAACCAACTCTCAAGTCAAGGCTTCCCACTCACCAATCTGGCCGATGGTAAGCATCAGAATTTTGGCACATCCAGTCTCTCAGTAGGCAGCATAACTGTTGCCTAA |
Protein: MYWLATRNAVVSIPRWRSLALLLRSPLNKYASLNPSSLLLGRQFGQIHCFKDKKILRETTKFTRKFKAPNRALDDKDLSHIIWWKERLQLCRKPSTLNLVKRLVYSNLLGVDVNLKNGSLKEGTLNCEILQFKSKFPREVLLCRVGDFYEALGIDACILVEYAGLNPFGGLRSDSIPRAGCPVVNLRQTLDDLTRNGYSVCIVEEVQGPTQARSRKGRFISGHAHPGSPYVFGLVGVDHDLDFPEPMPVVGISRSARGYCITLVLETMKTYSSEDGLTEEALVTKLRMCRYHHLFLHLSLRDNASGTCRWGEFGAGGLLWGECTTRHFEWFEGNPVTELLYKVKELYGLDDEVSFRNVTVPSESRPRPLHLGTATQIGAIPTEGIPCLLKVLLPSNCTGLPALYIRDLLLNPPAHEIASTIQATCKLMSSIKCSIPEFTCVASAKLVKLLELREANHIEFCRIKNVVDEILHMHRSTDLKEILKLLMDPAWVATGLKIDFETLVDECEWVSERIGQMIFLDGENDQKISSYANIPGEFFEDMESSWKGRVKKLHIEEAVAEVDSAAEALSLVVTEDFLPIVSRIKATSAPLGGPKGEILYAREHEAVWFKGKRFAPAVWAGTPGEEQIKQLKPALDSKGRKVGEEWFTTMKVEDALTRYHDAGGKAKARVLELLRGLSAELQTKINILVFASMLLVIAKALFAHVSEGRRRKWVFPILTGFSSSKGGESLDETRGMKIVGLTPYWFDVSEGCAVLNTVDMQSLFILTGPNGGGKSSLLRSICAAALLGICGFMVPAESALIPQFDSVMLHMKSYDSPADGKSSFQVEMSELRSIISGASSRSLVLVDEICRGTETVKGTCIAGSIVETLDEIGCLEYVDGQTKPTWKLVDGICRESLAFETAKKEGVAETIIQRAEELYSSVNAKEVSSGRFNTQLAQVGSEGAQLLSNRTQAGSLCHKRKPTNRMEVLQKEVESAVTLICQKKLMELYKQRNTLELPILNSVAIAAREQPPPSTIGASCLYVMFRPDKKLYIGETDDLDGRVRSHRSKEGMQNATFLYFIVPGKSIARQLETLLINQLSSQGFPLTNLADGKHQNFGTSSLSVGSITVA |